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Abstract 

In this paper I present a parser based on De- 
scription Logics (dl) for a German HPSG-style 
fragment. The specified parser relies mainly 
on the inferential capabiHties of the underlying 
DL system. Given a preferential default ex- 
tension for DL disambiguation is achieved by 
choosing the parse containing a qualitatively 
minimal number of exceptions. 

1 Introduction 

in 

O^n this paper I present a parser for HPSG based on De- 
0\cription Logics (DL). The main motivation for specifying 
■'""Such a parser relies on considerations concerning the dis- 
ambiguation of NL expressions. In [Schmitz, Quantz 93] 
cot is shown how different types of ambiguity can be han- 
*~llled with a homogeneous approach based on the notion 
OQof preference rules [Jackendoff 83]. A major requirement 
■T-^or such a unified approach is that information usually rep- 
resented rather differently (e.g. syntactic, semantic, and 
^f^ncyclopedic information) is homogeneously represented 
f~jn a uniform and declarative formalism in order to express 
f^^nd evaluate the complex preferences stemming from the 
^—different kinds of information. 

Description Logics have been developed in the 
VTield of Knowledge Representation (see, for example, 
^[Brachman et al. 91]). They have already been used for the 
(^presentation of semantic and encyclopedic information, 
'-jt.g. [Allgayer et al. 89, Stock 91, Preuss et al. 92]. Due to 
0l^ eir similarity to typed feature formalisms [Carpenter 92], 

Syntactic information is in principle also expressible in DL, 
s already sketched in [Quantz 93, Quantz, Schmitz 94]. 
furthermore. Preferential Default Description Logics 
(PDDL) based on weighted defaults [Quantz, Ryan 93] can 
be used to represent the preference rules in a declarative 
and formally well-founded way. 

In the following I will mainly show how HPSG-style 
syntactic information can be represented in DL, and how a 
simple parser can be build by using the inference capabil- 
ities of a DL system. Note that when specifying the parser 
I will keep the presentation as simple as possible, thereby 
deliberately ignoring efficiency aspects. I will also refrain 
from modeling all aspects of relevant knowledge in dl, i.e. 
there are still pieces of information which are not explic- 
itly encoded in the DL modeling, but are rather implicitly 
contained in the parser (e.g. information related to linear 
precedence and to traces). 

The main objective of this paper it thus neither to con- 
tribute to research in efficient parsing technology, nor to 
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develop a declarative formalism in which all aspects rele- 
vant for NLP can be represented. It is rather to provide the 
basis for an implementation of the exception minimiza- 
tion approach to interpretation proposed in [Quantz 93]. 
In Section 6 1 will briefiy sketch how the DL-based parser 
presented in Section 5 can be extended to realize this ap- 
proach. 

2 Basic Ideas 

An important distinction made in DL, but missing in tradi- 
tional feature formalisms, is the one between objects and 
types. DL formulae either express that a type ti is more 
specific than (or subsumed by) a type t2 (ti : < t2) or that an 
object o is an instance of a type or, using dl terminology, 
a concept (o :: c). 

Applying this schema to the task of NLP, we can say that 
the objects in this domain are words or phrases, and that 
the types are syntactic categories. Furthermore, given 
a phrase oi we have additional relations between this 
phrase and its constituents 02,03, . . . , usually expressed 
as "02 is a daughter of Oi". In DL this is modeled as 
'(01,02) :: dtrs', or equivalently as 'oi :: dtrs\02 . 'dtrs' 
thus acts as a binary predicate or, using DL terminology, 
as a role. Note that roles can have more than one value 
in contrast to features, which are functional. We can thus 
write 'oi :: dtrs:o2 & dtrs'.oj,'. 

Note further that the objects stand for occurrences of 
words or phrases, and that different occurrences of the 
same word will be represented by different objects. This 
is represented by writing '02 :: phon:er\ for example, to 
express that 02 is an occurrence of the form 'er'. 

This is all rather similar to standard HPSG notation, and 
the main difference is that in addition to the feature struc- 
tures used in HPSG, we add an additional level of objects, 
which we see as instances of the feature structures. Fea- 
ture structures thus correspond to types or more precisely 
to DL concepts. In a way, the objects in dl are used to 
make the HPSG feature structures persistent, i.e. to have 
pointers or names to refer to them. 

The additional level of objects allows a straightforward 
description of the parsing task. We start with a number 
of objects, namely words, whose phonological value and 
position is known. We want to end up with a single object 
containing all these words as (not necessarily immediate) 
constituents. Now the immediate dominance schemata 
in an HPSG tell us how to construct phrases from words 
or other phrases. Thus the main operation for building a 
phrase is to create a new object being an instance of an ID 
schema (note that ID schemata are feature structures and 
therefore concepts) and to fill in the required daughters 
by using the objects available as building material. This 
is achieved by choosing the 'functor-daughter' and filling 
the required arguments. 



Three points are important in the following sections: 

1. Obviously, objects cannot be combined in a ran- 
dom way. In HPSG the ID schemata and the lexical 
entries contain information concerning combination 
with other phrases. I will model this information in 
DL and use standard DL inferences to check consis- 
tency of combinations. Thus the DL system is used 
to perform the unification task underlying HPSG and 
similar Unification Grammars. 

2. An object can only be used as building material for a 
phrase if it has not yet been used as building material 
for some other phrase. Furthermore, when looking 
for daughters of a new phrase, we have to fill those 
daughters for which a filler is required, but not yet 
specified. I will use the epistemic operator k pro- 
posed in [Donini et al. 92] to formalize these intu- 
itions and then use standard DL retrieval for checking 
these constraints. 

3. For syntactically ambiguous expressions there 
is more than one possibility to combine the 
words/phrases. Since the objects and especially 
the relations between them are viewed from differ- 
ent perspectives in the alternative interpretations, we 
need a mechanism in DL to represent these different 
views. I will use situated descriptions 'o :: c in s' 
in the following to formalize this notion of differ- 
ent perspectives. There is a rough correspondence 
between the situations used to capture the specific 
interpretations and the charts created in chart pars- 
ing. 

3 The Underlying Description Logic 

Description Logics vary wrt the term-building operators 
they contain. In this section I will present the syntax 
of the DL which is used in the examples given in the 
next two sections. Due to space limitations I will not 
specify the formal semantics for this dl (see, for exam- 
ple, [Hoppe et al. 93, Quantz, Schmitz 94] for a model- 
theoretic semantics): 

t ^ c,r,tp,ti &t2 ,k(t) 
c the(r,c) , some(r) , no(r) , exactly(n,r) 

r:o , ri = r2 

r feat, domain(c) , range(c) , v\.X2 , inv(r) 

7 ti :< t2 , ti := t2 , ci => C2 

o :: c in s , extend_sit(si,S2) 

When specifying the fragment and the parser in the next 
sections I will use a notation based on the prolog inter- 
face provided by the BACK system [Hoppe et al. 93]. In 
BACK a distinction is made between term introductions or 
definitions, and constraint-like rules. A term name can be 
introduced either as primitive (t„ :< t), i.e. only necessary 
conditions are given, or as defined (t„ := t), i.e. necessary 
and sufficient conditions are given. A rule Ci => C2 means 
that each object being an instance of Ci is also an instance 
of C2. 

The formula 'extend_sit(si,S2)' expresses the fact that 
situation S2 is an extension of situation si. This means 
that 'o :: c in si ' implies 'o :: c in S2' for all objects o and 
concepts c. 

In order to distinguish between telling and querying 
information I will use 'o :: c in s' for tells and 'o ?: in s' 
for queries. I furthermore assume that a tell only suceeds 



if it is consistent with the previously entered information; 
otherwise it fails. When the object used in a query is a 
variable, the system will retrieve all known instances of a 
concept, i.e. 'Object ?: in s' will return the objects known 
to be instances of 'c' in 's' by backtracking. 

Note that the epistemic operator k will only be used 
in queries. It can therefore be straightforwardly integrated 
into existing DL systems. Since this is also true for situated 
descriptions, the parser presented in Section 5 is largely 
based on standard irrference capabilities of dl systems. 

4 A Small Fragment 

In this section I will present examples from an HPSG-style 
fragment for German modeled in DL. Due to space lim- 
itations 1 will not specify all the information contained 
in this modeling but only the one needed to illustrate the 
main characteristics of the formalization and the example 
sentence 'Die schone Frau sieht sie' discussed in the next 
section. 

The fragment is based on the presentation in 
[Pollard, Sag 87] and its application to German in 
[Hiltl91]. A main difference between my DL modeling 
and standard hpsg modeling is that I avoid feature pathes 
which would introduce superfluous DL objects. There is 
thus no feature 'head' in my modeling since it would yield 
the introduction of head objects whose ontological sta- 
tus seems controversial. Consequently, my Head Fea- 
ture Principle specifies equivalence not for a single feature 
'head', but rather for each head feature separately. 

The fragment contains five main categories, namely 
noun, np, verb, det, and adj. For illustration, the definitions 
of noun and np are given below: 

noun := maj:n&lex:+ 
np := maj:n & lex:- 

Phrase structure is represented by roles as the following: 

dtrs :< domain(sign) & range(sign) 

comp_dtrs :< dtrs 

comp_dtrl :< compjltrs & feat 

headjitr :< dtrs & feat 

functor_dtr :< dtrs & feat 

The feature 'functor_dtr' will be used by the parser to spec- 
ify the sign acting as functor of a new phrase. Its value 
will be identical to the value of 'head_dtr', 'adj_dtr', or 
'filler _dtr', depending on the particular Immediate Dom- 
inance (id) schema used. Note that the daughters which 
are modeled as features are functional, i.e. no phrase can 
have two fillers for 'head_dtr' . 

Corresponding to these daughter roles and features we 
have argument roles and features as 'comp_argr etc. I 
then distinguish the following types of phrase structures: 

comp-structure := some(headjdtr) & 

functor_dtr=head_dtr & 
no(adj_dtr) & no(filler_dtr) 
adj-Structure := some(adjjdtr) & some(headjdtr) & 
functor_dtr=adj_dtr & 
no(compjdtrs) & no(filler_dtr) 

filler-Structure := some(filler_dtr) & some(headjdtr) & 
functor_dti^filler_dtr & 
no(comp Jtrs) & no(adj_dtr) 

Thus in a 'comp_structure' the 'head_dtr' acts as a functor. 
Note that it has to be explicitly stated whether a certain 
feature is empty, e.g. 'no(adj_dtr)' for 'comp_structure' . 
DL systems assume an open world and take all descriptions 



as being partial, i.e. the fact that there is currently no known 
filler for a role at an object does not imply that there will 
never be one. 

The fragment contains six ID schemata, namely three 
for noun phrases, one for verb phrases, one for adjuncts, 
and one for topicalization. 



idl 



id2 



id3 

id4 
ids 



id6 



comp-structure & 
the(head Jtr,np & nformxomm) & 
some(compjdtrl) & no(comp_args) 
comp_structure & 

the(headjdtr,noun & nformxomm) & 
no(compjJtrs) & some(comp_argl) 
comp_structure & the(headjdtr,verb) & 
no(comp_args) & mc:- 
adj_structure &. the(adj-dtr,adjunct) 
filler-Structure & 

the(headjdtr,maj:v Sl no(comp_args)) & 
mc:+ & the(filler_dtr,top:+) 
comp_structure & 
tlie(lieadjltr,noun & nform:pro) & 
no(compjltrs) & no(comp_args) 



For the lexical entries 1 will use three morpho-sytntactic 
features (nform, case, gen) to illustrate agreement between 
nouns, adjectives, and determiners. Agreement concern- 
ing case and gender between nouns and determiners is 
modeled by specifying that the value of the feature 'case' 
at a common noun is the same as the value of the feature 
'case' at the object filling the feature 'comp_argr (which 
is the determiner). 

Below are lexical entries for 'frau' and 'sie': 

noun & nformxormn => exactly(l,comp_args) & 

the(comp_argl,det) & 
case=comp.argl.case & 
gen=comp_arg 1 .gen) 
= > no(comp_args) 
=> noun (& nformxomm & gen:f 
= > lexeme:frau 
= > noun & nform:pro 
= > lexeme:er_sie & gen:f 

Note the hierarchical nature of the modeling — the sub- 
categorization information is specified for conomon nouns 
and pronouns in general, and is then inherited by each 
specific connmon noun and pronoun. Information shared 
by all forms of a lexeme is specified as a property of the 
lexeme, whereas information specific to a particular form 
is specified for this form only. 

Adjectives require non- saturated noun phrases as argu- 
ments and agree with them wrt case and gender: 



noun & nform:pro 
lexeme: frau 
phon:frau 
lexeme:er_sie 
phon:sie 



adj => adjunct & case=mod_argxase & 
gen=mod_arg.gen & 
the(mod-arg,np & some(comp_args)) 
adj 

lexeme:schoen 



= > 
= > 



lexeme:schoen 
phon:schoene 

Finally, the lexical entry for 'sieht': 

verb => the(comp_argl,np c& case:nom) 
lexeme:sehen => verb & exactly(2,comp_args) 
the(comp_arg2,np & case:acc) 
phon:sieht => lexeme:sehen 

Note that for verbs taking more than two arguments we 
need additional features 'comp_arg3' and 'comp_arg4'. 

In addition to the information modeled so far we need a 
formalization of the principles underlying the combination 
of signs in HPSG. Some of these principles hold only for 



phrases and not for signs in general. A phrase is defined 
as follows: 

phrase := some{dtrs) 
phrase => lex:- 

lex:- => phrase 

The Head Feature Principle is then defined as: 

phrase => maj=headJtr.maj &. 

gen=headjdtr.gen &. 
case=head-dtr.case 

The parsing process presented in the next section is essen- 
tially triggered by signs which can act as functors, namely 
signs with unsaturated subcat lists, signs with slashes, and 
pronouns: 

somc(args) => functor 
some(slash) => functor 
noun & nform:pro => functor 

5 DL-Based Parsing 

In this section I will present the basic structure of a DL- 

based parser for the above fragment. The parser is realized 
by five main predicates. I assume that the initial informa- 
tion given to the parser consists of descriptions of the 
words occurring in the expression to be parsed. Consider 
the ambiguous sentence 

(1) Die schone Frau sieht sie. 

(2) The pretty woman sees her. 

(3) The pretty woman she sees. 

The initial DL representation of this sentence is: 
wi :: phon:die & start:0 & end:l in si 

W5 :: phon:sie & start:4 & end:5 in si 

Given this information the parser builds phrases from the 
five words. This is done by creating new phrases until 
no more combinations of signs are possible. The parsing 
succeeds if the words have been all used up and a single 
phrase results: 

parse_sign(Sit,Sit) :- 

findall(Sign,Sign ?: sign & no(k(inv(dtrs))) in Sit,[-]). 
parse_sign(Sit,FinSit) :- 

new_phrase(Sit,NewSit), 

parse_sign(NewSit,FinSit). 

Note that the epistemic concept 'no(k(inv(dtrs)))' is used 
to determine whether a sign is still available for phrase 
building. An object is an instance of this concept if it is 
not a filler of some 'dtrs' role at any other object. 

The basic idea of building a new phrase is to look for 
a sign which can act as a functor, to choose an ID schema 
in which this sign is a functor, and to find the required 
arguments of the functor. Finally, the linear precedence 
rules are checked and, if necessary, traces are introduced.^ 

new_phrase(Sit,FinSit) :- 

Sign ?: functor & no(k(inv(dtrs))) in Sit, 
select Jd_schema(Sign,Sit,Phrase,NewSit), 
complete-arguments(Sign,NewSit,NextSit), 
checkJps-and-Continuity(Phrase,Sit,NextSit,FinSit). 

Selection of an ID schema is realized in a rather naive and 
simple way — we just take an id schema and try to create a 
new phrase as an instance of this schema, where the feature 
'functor Jtr' is filled by the functor. 



'Due to space limitations I do not specify the predicate 
'check Jps_andxontinuity' in this paper. 
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N (p7, id4) 
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frau (w3) 
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Figure 1: Phrase structure of the sentence 'Die schone Frau sieht sie' (The pretty woman sees her). 'H' stands for 
'head_dtr', 'C for 'comp_dtr', 'A' for 'adj.dtr', and 'F' for 'filler_dtr'. 



select Jd_schema(Sign,Sit,Phrase,NewSit) :- 
id_schema(ID), 
extend-sit(Sit,NewSit), 
Phrase :: ID & functor_dtr;Sign in NewSit. 

Information about existing ID schemata thus has to be 
encoded as facts of the form 'id_schema(idl)', etc. The 
predicate 'extend_sit(Sit,NewSit)' is used to tell the dl 
system to create a new situation which is an extension of 
the current situation. 

Note that no further knowledge about the actual mod- 
eling of ID schemata is used in the parser except for the 
fact that each id schema has a 'functor_dtr' . Note further 
that the DL tell will fail if the information known about the 
functor cannot be unified with the information required by 
the ID schema for the filler of 'functor Jtr' . 

In order to complete the arguments of the functor, the 
parser checks for each argument feature ArgFeat whether 
an argument is required (some(ArgFeat)) but not yet spec- 
ified (no(k( ArgFeat))). If so, 'find_arg' looks for such an 
argument and enters it as filler for ArgFeat. Then the 
remaining arguments are completed. 

complete_arguments(Func tor, Sit,FinSit) :- 
arg_feature(ArgFeat), 

Functor ?: some(ArgFeat) & no(k(ArgFeat)) in Sit, 
! ,flnd-arg{Functor, Sit, ArgFeat,NewSit), 
complete-arguments(Functor,NewSit,FinSit). 
complete-arguments(-,Sit,Sit). 

Again we need to introduce facts specifying the arguments 
used in the fragment, e.g. 'arg_feature(comp_argl)'. 

If an argument is required it has to be filled, therefore 
the Cut. Thus the recursion terminates successfully only 
when all required arguments are actually filled. Note that 
the only information about argument structure needed by 
the parser are facts of the form 'arg_feature(comp_argl)' 
for all argument features. 

To find an argument the parser looks for a sign which 
has not yet been used for phrase building and asserts it as 
filler for the argument feature. Again, if unification is not 
possible due to conflicting constraints (e.g. agreement), 
the DL tell will fail. 

find-arg(Functor,Sit,ArgFeat,FinSit) :- 
Arg ?: sign & no(k(inv(dtrs))) in Sit, 
extend-sit(Sit,FinSit), 



Functor :: ArgFeat: Arg in FinSit. 
find_arg(Functor,Sit,ArgFeat,FinSit) :- 
new4)hrase(Sit,NewSit), 
find-arg(Functor,Sit,ArgFeat,FinSit). 

The second clause is needed to create a required argument 
which has not yet been build up. In this case 'new_phrase' 
is called to create a new potential argument. 

For the sentence 'Die schone Frau sieht sie' we obtain 
two different parses, since both 'die schoene frau' and 
'sie' are ambiguous between nominative and accusative 
case. The reading according to which 'die schoene frau' 
is subject is shown in Figure 1 as a phrase structure tree. 
Some of the corresponding information contained in the 
DL situation representing this reading is given below: 

Wl :: phon:die & case:nom & start:0 & end:l 
P8 :: idl & headJtr:p7 & compJtrhwi & 

start:0 & end:3 
P9 :: id6 & headjdtriws & start:4 & end:5 
tio :: trace & tracing: W4 & comp_argl:tii & 

comp_arg2:p9 & start:5 & end:5 
til :: trace & tracing;p8 & start:4 & end:4 
pi2 :: id3 & headjltr:tio & 

compjdtrl:tii & compjltr2:p9 & 

slash:w4 & slash:p8 c& start:4 & end:5 
pi3 :: id5 & headjdtr:pi2 & filler_dtr:w4 & slash:p8 & 

start:3 & end:5 
pi4 :: id5 & headaitr:pi3 & filler_dtr:p8 & 

start:0 & end:5 

In the second parse tn and p9 swap places, i.e. pg is the 
'comp-dtrl' of pi2 and tn is the 'comp_dtr2'. 

The result of the parsing process illustrates the object- 
centeredness of dl representations. The constituents of 
the utterance are explicitly modeled and can be used 
for extracting or specifying further information. Thus 
we can choose to introduce a feature 'subject' and 
add the fact 'pi4 :: subjecttpg', or we can retrieve all 
the saturated noun phrases (Phrase ?: np & no(args)). 
This object-centeredness is useful for disambiguation, 
for example for anaphora resolution, as illustrated in 
[Quantz, Schmitz 94]. 



6 Interpretation as Exception Minimization 

I will now briefly sketch how the parser presented in 
the previous section can be extended to perform dis- 
ambiguation by exception minimization as proposed in 
[Quantz 93]. In case of ambiguous expressions the parser 
will return more than one situation. The basic idea of inter- 
pretation as exception minimization is to model additional 
preference rules needed for disambiguation as DL defaults, 
and to choose the interpretation violating a qualitatively 
minimal set of defaults. 

A Preferential Default Description Logic (PDDL) 
based on weigthed defaults has been developed in 
[Quantz, Ryan 93]. A weigthed default 6 has the form 
Ci C2, where C] is called the premise of 6 (Sp), C2 the 
conclusions of 6 (Sc) and n the weight of 6 {w{6)) — the 
higher the weight, the more relevant the default. For this 
PDDL a formally well-behaved preferential entailment re- 
lation l^j. is presented, which is based on an ordering 
on DL models '^d:- The basic idea of this preferential 
semantics is to compute a score for each model by sum- 
ming up the exceptions to the defaults. Models with lower 
score are then preferred because they qualitatively mini- 
mize the exceptions. It is straightforward to carry the idea 
of scoring and ordering over from models to situation. To 
do so, we compute for each situation s and each default 6 
the exceptions — those objects for which 'Object ?: 6p in 
s' succeeds and 'Object ?: 6c in s' fails. 

If there are several possible interpretations for an ex- 
pression we choose the interpretation given by the situa- 
tion with the lowest score. (Note that there may be truely 
ambiguous expressions which yield situations with iden- 
tical scores.) Thus taking the above example, we might 
use a preference for topicalization of subjects to prefer the 
parse shown in Figure 1. This can be achieved by simply 
introducing a default 

np & top:+ ^5 case:nom 

Obviously, this default is a rather weak one and can be 
overwritten by information stemming from selectional re- 
strictions [Schmitz, Quantz 93] . 

In principle, it is possible to use preferences stemming 
from weighted defaults already in the parsing process — 
situations whose score is higher than a specified threshold 
are not processed any further. Thus instead of producing 
all parses in the first step and ordering them in a second 
step, the parser would then only produce the preferred 
reading. 

7 Conclusion 

I have presented a DL-based parser for a small HPSG-style 
fragment of German. Most aspects of the grammar and 
the parser have been modeled in a highly declarative way. 
Since the main motivation for my presentation has been to 
show how an hpsg parser can be implemented in principle 
by using the inference functionality of a DL system, I have 
deliberately ignored any efficiency issues. It should be 
obvious, however, that the parser can be further optimized 
to increase its performance, for example by integrating 
chart parsing techniques. We are currently testing the 
performance of alternative implementations of both the 
parser and the underlying DL system. 

One advantage of using dl as underlying formalism is 
that in addition to the syntactic information modeled in 
this paper, semantic and encyclopedic information can be 



easily integrated into the presented framework. Further- 
more, Preferential Default Description Logics can be used 
to model preference rules as weighted defaults, thereby 
obtaining interpretation as exception minimization. The 
parser presented in this paper thus provides the basis for 
an homogeneous and formally well-founded approach to 
disambiguation based on Preferential Default Description 
Logics. 
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